fix: adding data specific p-value filters #788

addramir · 2024-09-24T16:31:28Z

✨ Context

Data specific lead p-value CS filters.

🛠 What does this PR implement

🙈 Missing

🚦 Before submitting

Do these changes cover one single feature (one change at a time)?
Did you read the contributor guideline?
Did you make sure to update the documentation with your changes?
Did you make sure there is no commented out code in this PR?
Did you follow conventional commits standards in PR title and commit messages?
Did you make sure the branch is up-to-date with the dev branch?
Did you write any new necessary tests?
Did you make sure the changes pass local tests (make test)?
Did you make sure the changes pass pre-commit rules (e.g poetry run pre-commit run --all-files)?

DSuveges · 2024-09-25T08:44:57Z

src/gentropy/finngen_finemapping_ingestion.py

@@ -37,6 +37,10 @@ def __init__(
            finngen_susie_finemapping_cs_summary_files=finngen_susie_finemapping_cs_summary_files,
        )

+        finngen_finemapping_df = finngen_finemapping_df.validate_lead_pvalue(


Minor stylistic comment, and it is absolutely my preference, but I like to use as few variables as possible:

( # Reading Finngen finemapped dataset and convert it to study locus: FinnGenFinemapping.from_finngen_susie_finemapping( spark=session.spark, finngen_susie_finemapping_snp_files=finngen_susie_finemapping_snp_files, finngen_susie_finemapping_cs_summary_files=finngen_susie_finemapping_cs_summary_files, ) # Flagging sub-significnat loci: .validate_lead_pvalue( pvalue_cutoff=FinngenFinemappingConfig().finngen_finemapping_lead_pvalue_threshold ) # Write the output. .df.write.mode(session.write_mode).parquet( finngen_finemapping_out ) )

Please carefully check, I removed the variables but not sure

DSuveges

I might missing something, but could not found the flagging of the in-house finemapped datasets. Also you mentioned there would be specific p-value cutoff for UKBPPP.

DSuveges · 2024-09-25T12:16:42Z

src/gentropy/config.py

@@ -169,6 +175,7 @@ class FinngenFinemappingConfig(StepConfig):
    _target_: str = (
        "gentropy.finngen_finemapping_ingestion.FinnGenFinemappingIngestionStep"
    )
+    finngen_finemapping_lead_pvalue_threshold: float = 1e-5


There's one thing I'm not sure about. You have added finngen_finemapping_lead_pvalue_threshold to the relevant config, and refer to as pvalue_cutoff=FinngenFinemappingConfig().finngen_finemapping_lead_pvalue_threshold in the step. However, finngen_finemapping_lead_pvalue_threshold it not an argument for FinnGenFinemappingIngestionStep. Is it OK? Would all parameters in the config passed to the step? Would that cause any problem? @project-defiant , what do you think?

OK, double-checked with @project-defiant and @d0choa and all parameters in the setepConfig classes needs to be parameters in the init function of the step.

DSuveges · 2024-09-25T12:22:56Z

src/gentropy/eqtl_catalogue.py

+            EqtlCatalogueFinemapping.from_susie_results(processed_susie_df)
+            # Flagging sub-significnat loci:
+            .validate_lead_pvalue(
+                pvalue_cutoff=EqtlCatalogueConfig().eqtl_lead_pvalue_threshold


Same comment: EqtlCatalogueConfig().eqtl_lead_pvalue_threshold has to be passed as parameter of the step.

DSuveges · 2024-09-25T12:23:36Z

src/gentropy/finngen_finemapping_ingestion.py

+            )
+            # Flagging sub-significnat loci:
+            .validate_lead_pvalue(
+                pvalue_cutoff=FinngenFinemappingConfig().finngen_finemapping_lead_pvalue_threshold


Same comment: FinngenFinemappingConfig().finngen_finemapping_lead_pvalue_threshold has to be an init parameter of the step.

DSuveges · 2024-09-25T12:23:53Z

src/gentropy/pics.py

+            .filter_credible_set(credible_interval=CredibleInterval.IS99)
+            # Flagging sub-significnat loci:
+            .validate_lead_pvalue(
+                pvalue_cutoff=WindowBasedClumpingStepConfig().gwas_significance


Same comment.

DSuveges

Any parameter that are defined in the StepConfigs needs to be defined as parameters of the init methods in the respective classes otherwise the step would fail. The reason is that the step object would be initialised with an unexpected parameter.

addramir · 2024-09-30T10:00:21Z

Any parameter that are defined in the StepConfigs needs to be defined as parameters of the init methods in the respective classes otherwise the step would fail. The reason is that the step object would be initialised with an unexpected parameter.

Fixed. Please check

DSuveges · 2024-09-30T11:36:44Z

Any parameter that are defined in the StepConfigs needs to be defined as parameters of the init methods in the respective classes otherwise the step would fail. The reason is that the step object would be initialised with an unexpected parameter.

Fixed. Please check

@addramir , sorry there's an other one here

DSuveges

This PR as it is now, makes it impossible to parametrise the p-value threshold of the PICS step. IF other steps are called as a stand-alone command, users can provide the custom p-value threshold. However this is not true for PICS step. The applied cutoff is defined by config (WindowBasedClumpingStepConfig().gwas_significance), which value cannot be overridden.

I think this might be fine for now, but we should follow the pattern of other steps where all parameters inside the steps are customisable.

fix: adding data specific fillters

73b97a3

github-actions bot added bug Something isn't working size-S Step labels Sep 24, 2024

addramir marked this pull request as ready for review September 24, 2024 18:45

Merge branch 'dev' into yt_adding_data_specific_fillters

0077370

addramir mentioned this pull request Sep 24, 2024

fix: clean unused study_locus_validation step parameter #786

Merged

addramir requested review from DSuveges and d0choa September 24, 2024 18:46

addramir mentioned this pull request Sep 24, 2024

validate_lead_pvalue in study locus fillters all StydyLoci but it shouldn't opentargets/issues#3529

Closed

Merge branch 'dev' into yt_adding_data_specific_fillters

0f5b669

DSuveges reviewed Sep 25, 2024

View reviewed changes

addramir added 2 commits September 25, 2024 11:43

Merge branch 'dev' into yt_adding_data_specific_fillters

f629e63

fix: removing variables

9d66127

DSuveges reviewed Sep 25, 2024

View reviewed changes

DSuveges requested changes Sep 25, 2024

View reviewed changes

DSuveges and others added 3 commits September 26, 2024 16:51

Merge branch 'dev' into yt_adding_data_specific_fillters

88e4dd6

Merge branch 'dev' into yt_adding_data_specific_fillters

996fe3a

fix: adding options to init

5c57ab4

Merge branch 'dev' into yt_adding_data_specific_fillters

3b95cfd

DSuveges approved these changes Sep 30, 2024

View reviewed changes

DSuveges merged commit 8b253a5 into dev Sep 30, 2024
5 checks passed

DSuveges deleted the yt_adding_data_specific_fillters branch October 1, 2024 11:09

project-defiant mentioned this pull request Oct 4, 2024

fix(validation): add qualityControls column if missing in StudyLocus dataset when perfroming validation #814

Merged

13 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: adding data specific p-value filters #788

fix: adding data specific p-value filters #788

addramir commented Sep 24, 2024

DSuveges Sep 25, 2024

addramir Sep 25, 2024

DSuveges left a comment

DSuveges Sep 25, 2024

DSuveges Sep 25, 2024

DSuveges Sep 25, 2024

DSuveges Sep 25, 2024

DSuveges Sep 25, 2024

DSuveges left a comment

addramir commented Sep 30, 2024

DSuveges commented Sep 30, 2024

DSuveges left a comment

fix: adding data specific p-value filters #788

fix: adding data specific p-value filters #788

Conversation

addramir commented Sep 24, 2024

✨ Context

🛠 What does this PR implement

🙈 Missing

🚦 Before submitting

DSuveges Sep 25, 2024

Choose a reason for hiding this comment

addramir Sep 25, 2024

Choose a reason for hiding this comment

DSuveges left a comment

Choose a reason for hiding this comment

DSuveges Sep 25, 2024

Choose a reason for hiding this comment

DSuveges Sep 25, 2024

Choose a reason for hiding this comment

DSuveges Sep 25, 2024

Choose a reason for hiding this comment

DSuveges Sep 25, 2024

Choose a reason for hiding this comment

DSuveges Sep 25, 2024

Choose a reason for hiding this comment

DSuveges left a comment

Choose a reason for hiding this comment

addramir commented Sep 30, 2024

DSuveges commented Sep 30, 2024

DSuveges left a comment

Choose a reason for hiding this comment